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Abstract. We present an algorithm to convert a word of length n in 
the standard generators of the solvable Baumslag-Solitar group BS{1, p) 
into a geodesic word, which runs in linear time and space on a random 
access machine. 



1. Introduction 

Recently Miasnikov, Roman'kov, Ushakov and Vershik [5] proved that for 
free metabelian groups with standard generating sets, the following problem 
is NP-complete: 

• given a word in the generators and an integer fc, decide whether the 
geodesic length of the word is less than k. 
They call this the bounded geodesic length problem for a group G with finite 
generating set Q. It follows that given a word, computing its geodesic length, 
and finding an explicit geodesic representative for it, are NP-hard problems. 
These problems are referred to as the geodesic length problem and the geodesic 
problem respectively. 

In this article we consider the same problems for a different class of metabelian 
groups, the well known Baumslag-Solitar groups, with presentations 

{a,t I tat^^ = qP) 

for any integer p > 2. We give a deterministic algorithm which takes as input 
a word in the generators a^^,t^^ of length n, and outputs a geodesic word 
representing the same group element, in time 0{n). Consequently, the three 
problems are solvable in linear timeQ. 

In an unpublished preprint ^6, Miller gives an effective procedure to convert 
a word in the above group to a geodesic of the form t~^zt~"^ where z belongs 
to a regular language over the alphabet {a, a~^,t}. Miller's algorithm as it is 
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given in his (unpublished) preprint would take exponential time worst case. 
The algorithm presented here closely follows Miller's procedure with some 
modifications (using pointers in part one and a careful tracking procedure in 
part two) to ensure linear time and space. We use as our computational model 
a random access machine, which allows us to access (read, write and delete) 
any specified location in an array in constant time. 

Recent work of Diekert and Laun 1 extends the result of this paper to 
groups of the form (a, t \ ta^t^^ = a'?) when p divides q. Their algorithm runs 
in quadratic time, but in the case p — 1 the time reduces to linear. 

The author wishes to thank Sasha Ushakov and Alexei Miasnikov for sug- 
gesting this problem, and Sasha, Alexei, Alex Myasnikov, Igor Lysionok, An- 
drew Rechnitzer and Yves Stalder for many helpful suggestions. The author 
thanks the anonymous reviewer for their careful reading and instructive com- 
ments and corrections. The author gratefully acknowledges the support of the 
Algebraic Cryptography Center at Stevens Institute of Technology. 

2. Preliminaries 

Fix Gp to be the Baumslag-Solitar group (a, t \ t~^at = a^) for some p > 2. 
We will call a single relator a brick, with sides labeled by t edges, as in Figure 
[H The Cay ley graph can be obtained by gluing together these bricks. We call 
a sheet a subset of the Cayley graph made by laying rows of bricks atop each 
other to make a plane, also shown in Figure [1] The complete Cayley graph is 
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Figure 1 . Parts of the Cayley graph of G3 

obtained by gluing these sheets together so that every vertex has degree four. 
From side-on the Cayley graph looks like a rooted p-ary tree. We orient each 
a edge to the right and each t each going up. Some nice pictures can be found 
in [3] pp. 155-160. 
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A word in the generators a^^ ,t^^ is said to be of the form P if it contains 
no t^^ letters and at least one t letter, and of the form N if it contains no 
t's and at least one t~^. Then a word is of the form PNP say, if it is the 
concatenation of three words of the form P, N, P in that order. The t-exponent 
sum of a word is the number of t letters minus the number of letters. We 
write =G when two words represent the same element in the group and = 
when they are identical as strings, and £{w) is the number of letters in the 
string w. 

The following simple lemma and corollaries come from [5] and [1]. 

Lemma 1 (Commutation). Ifu,v have t-exponent sum zero then uv =g vu. 

Corollary 2 (Geodesies). A geodesic cannot contain a subword of the form 
NPNP or PNPN. 

Corollary 3 (Pushing as). IJ w is of type NP and has t-exponent zero then 
w —G u — t^^up where up is of type P and t-exponent k, and i{u) < £{w). 
If w is of type PN and t-exponent zero then w ~g ^ — vpt^^ where vp is of 
type P and t-exponent k, and £(v) < £{w). 

The two corollaries are simply a matter of commuting subwords of t- 
exponent zero past each other. We will show how this can be done in linear 
time and space in the algorithm. The trick is to use pointers, which we will 
explain in Section [4] below. 

Here is an outline of the algorithm. We start with an input word of length n, 
which we write onto a tape of length ?i, freely reducing as we write. Scanning 
through the word and keeping track of the i-exponent we can determine the 
total t-exponent sum of the input word. If it is negative, write the inverse of 
the word on a new tape (which takes linear time), and remember to output 
the inverse of the final result at the end. So without loss of generality our 
input word has non-negative i-exponent sum. 

The first part of the algorithm transforms the input word into a word of 
the form u = t~^upt~'^ , as Miller does, which is equal in the group to the 
input word, where the word up — a'^°t . . . ta'^'' with q > k + m, \ei\ < p for 
i = 0, . . . , q — 1 and \eq\ < 3p. This word is not necessarily geodesic but is 
close. In part two of the algorithm we prove that u is (almost) fellow traveled 
by a geodesic for it, which lives in the same sheet of the Cayley graph, and 
because of this fact we are able to trace along u and find a geodesic for it in 
linear time and space. 

3. Data structure and subroutines 

We will make use of the following data structure and subroutines in part 
one of the algorithm. Assume the input word has length n. 

Construct a list we will call List A of n -f 2 5-tuples, which we view as an 
5 X (n -|- 2) table. Each address in the table will contain either a blank symbol. 
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an integer (between — n and n + 1), or the symbol t,t ^,a,a or F. We 
refer to an address by the ordered pair (row,column). 

• Write the numbers to n + 1 in the first row. These entries will stay 
fixed throughout the algorithm. 

• Row 2 will store the input word. Write S for start at address (2,0), 
then the input word letter by letter in addresses (2, 1) to (2,n), and 
at address (2,n + 1) write F for finish. As the algorithm progresses, 
these entries will either remain in their original positions, or be erased 
(and replaced by a blank symbol). S and F are never erased. 

• Row 3 will contain no entries at the beginning. As the algorithm pro- 
gresses we will use the addresses in this row to store integers (between 
—n to n). 

• Write the numbers 1 through n + 1 in the first n + 1 addresses of row 
4. Leave the final address blank. This row will act as a pointer to 
the next column address to be read. As the algorithm progresses, the 
entries in this row may change. 

• In row 5, write a blank symbol in the first address, then write the 
numbers through n in the remaining addresses. This row indicates 
the previous column address that was read. 

Here is List A in its initial state, with input word at^a. . . at~^. 
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As the algorithm progresses, we will "reorder" the word written in row 2 using 
the pointers in rows 4 and 5 (and leaving the letters in row 2 fixed, possibly 
erasing some) . To read the word, start at the S symbol. Move to the column 
address indicated in row 4. At the beginning this will be column 1. From 
the current column read the entry in row 4 to move the the next column. 
Continue until you reach the F symbol. At any stage, to step back to the 
previous address, go to the cohimn address indicated by row 5. Throughout 
the algorithm, the pointers will never point to or from a column which has a 
blank symbol in row 2. The pointers allow us to rearrange and delete letters 
from the word in row 2 efficiently (in constant time), without having to move 
any letters on the table. 

For convenience, we indicate the current address being read by a cursor. 
We assume that moving the cursor from one position in the list to another 
takes constant time on a random access machine. 



A LINEAR-TIME ALGORITHM TO COMPUTE GEODESICS 5 

Here are two subroutines that we will use many times. Each one takes 
constant time to call. Assume that the cursor is pointing to column k, and 
that the entry in the address (2, k) is not blank. 
Subroutine 1: Free reduction 

Read the entries in rows 2,4 and 5 of column k. Say the letter in row 2 is 
X, and the integers in rows 4 and 5 are 

If position j row 2 is x~^, then we can cancel this pair of generators from 
the word as follows: 

• read the integer in row 5 position j, and go to the address indicated 
(say it is r). In row 4 of this address, write i. In row 5 position i, 
write r. 

• erase entries in columns j and k 

• go to position i. 

In this way, we have deleted x~^x from the word, and adjusted the pointers 
so that they skip these positions. 
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Else, if position i row 2 is x~^, we perform a similar operation to erase 
xx^^ from the word, adjusting pointers appropriately. 

Assuming that we can access positions using the pointers in constant time 
(that is, we have a random access machine), then this procedure takes constant 
time to run. 

The next subroutine eliminates the occurrence of subwords a^^^ (where 
p > 2 is fixed). Again, assume the pointer is at position k and the entry at 
address (2,fc) is not blank. 
Subroutine 2: Consecutive as 

• If the letter at this address is o, set a counter acount = 1. Move back 
one square (using pointer in row 5) to column j. If address (2,j) is 
a, increment acount. Repeat until acount = 3p or the next letter is 
not a. Note maximum number of steps is 3p (constant). 
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If acount — 3p and you are at column i, write ta'H^^ over the first 5 
as, and blank symbols in the remaining 3p — 5 addresses up to position 
k. Adjust the pointers so that the pointer at the added points to 
the value indicated at (4,fc), and write the appropriate value in row 5 
of that position. 

• If the letter at this position is a~^, do the same with instead of 
a. 

This procedure takes constant time, and if it is successful (that is, replaces 
^±3p ^-1qj±3^^ strictly reduces the length of the word. 

4. Algorithm part 1 
Assume the input word has length n. 

Step 1: Write the input word in freely reduced form on List A as follows. 
Read the first letter and write it in address (1,2) of List A. For each subsequent 
letter if it freely cancels with the previous letter in row 2, erase the previous 
letter and continue. At the same time record the successive t-exponent sum 
of the word by incrementing and decrementing a counter each time a t^^ is 
read. If the final f-exponent sum is negative, rewrite the inverse of the word 
on a new List A. At the end of the algorithm, remember to output the inverse 
of the final output. 

So the word in row 2 of the tape is freely reduced and has nonnegative 
f-exponent. Fill in rows 1,4 and 5 of List A with the column numbers and 
pointers set to the initial state. 

Step 2: Move cursor to column 0, then take 3p — 1 steps to the right to 
column k. Assume the entire word in row 2 is freely reduced, and the word 
up to column k contains no more than 3p — 1 consecutive as or a~^s. Go to 
the next column (indicated by row 4), and perform Subroutine 2. 

If the subroutine finds a^^^, then with the cursor at the each end of the 
inserted word (of length 5), perform Subroutine 1. Repeat until Subroutine 
1 finds no more canceling pairs. Then move back to the next position after k 
and continue applying Subroutine 2. 

At the end of this procedure, the entire word is freely reduced and has no 
Q,±3p subwords. The number of times Subroutine 2 is performed is at most 
n, and the total number of times we perform Subroutine 1 is 0{n) since each 
time it is successful the word reduces length, so it is successful at most n 
times and unsuccessful at most twice (for each end) after each application of 
Subroutine 2. 

So wc now have a freely reduced word with less than 3p a^^ letters in 
succession, in row 2 of List A. The pointers in row 4 still point to columns to 
the right, since we have not commuted any subwords yet. 
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Step 3: Construct a second list we call List B of 2n + 1 4-tuples, which we 
view as a 4 X (2n + 1) table. In the first row write the integers from —n to n. 

Starting at column of List A, set a counter texp=0. Reading the word in 
row 2 from left to right, if in column k you read a t*^ letter, add ±1 to texp, 
and write the value of texp at address (3, k) of List A. In List B, if address 
(2, texp) is blank, write k. If it contains a value, write k in address (3, texp) 
if it is blank, otherwise in address (4, texp). 

In other words, each time you read a , write the current t-exponent sum 
underneath it, and in List B keep a record of how many times this t-exponent 
has appeared (which we call the number of strikes for that exponent) and at 
which positions in List A it appeared. 

Here we show List A for the input word at^atat^^at^^at^^ata . . ., and the 
corresponding List B, as an example. 
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When an entry occurs in the last row of List B at some position labeled 
texp, meaning some exponent has has "3 strikes", then we have a prefix 
of the form either NPNP or PNPN, so we apply Corollary [2] as follows. 
Suppose the entries in this column are Pa,Pb,P3, with the most recently 
added. These correspond to the positions in List A where the value texp have 
appeared. 

To begin with, the word written in row 2 of List A appears in the correct 
order (from left to right), the pointers have only been used to possibly skip 
blank addresses. So at the start of this step we know that pa comes before pb- 
As the algorithm progresses, we will not know which of pa and pi, comes first 
in the word. That is, as we introduce pointers to List A to move subwords 
around, a letter in column p could sit before a letter in column q with q < p. 
We do know that ps is the right-most position. 

The word read in its current order is either . . . pa ■ ■ ■ pt ■ ■ ■ P3 ■ ■ ■ or 
. . .ph . . .Pa ■ ■ - Ps ■ ■ ■■ We can determine the order with the following subrou- 
tine. 

Subroutine 3: Determine order oi pa,Pb- 
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Starting at pa, scan back (using pointers in row 5) through the word to the 
position of the previous t^^ letter, or the S symboL Since we have at most 
3p — 1 consecutive a^^ letters, this takes constant time. Do the same for pt,. 

If we come to S from either pa or ph, then we know that this position must 
come first. 

If both Pa,Pb are preceded by t^^ letters, then we need more information. 
Start at pa and scan forward to the first t^^ , whose position we call qa- Start at 
Pb and scan forward to the first t^^ , call this position qi,. This takes constant 
time since there are at most 3p — 1 consecutive a^^ letters. 

Now one of qa,qb must contain a t^-^ in row 2, with sign opposite to that 
of P3. 

• If qa is same sign as ps , then order must be Pa — <la — Pb — lb — P3 

• If qb is same sign as p3, then order must be Pb — qb — Pa ~ la — Ps 

• Both Qa, qb have opposite sign to p^. In this case, we look at the letters 
in row 2 positions Pa,Pb- If Pa has opposite sign to p^, then it must 
come first, since one oi Pa,Pb must match up with qa,qb- 

If both Pa,Pb have same letter as p^ in row 2, then we are in a 
situation like tat^^atat^^at. But since there is a t^^ letter preceding 
both Pa and pb, then the t-exponent before Pa,Pb,P'i are read is the 
same, and is recorded three times. This case cannot arise since we 
apply this procedure the first time we see the same number more than 
twice. 

Using this subroutine we can determine the correct order of the positions 
Pa,Pb and qa,qb, in 0(n) time. Rename the first position pi and second p2, 
and qi , (j2 as appropriate. So we have pi — qi — P2 — <l2 — Ps 

The subword between positions qi and p2 has t-exponent 0, as does the 
subword from q2 to p^. By commuting one of these subwords (using Lemma 
[Ij we can place a t next to a t^^ somewhere and get a free cancellation. The 
precise instruction will depend on the letters at each of these positions, and 
we will consider each situation case-by-case. 
Case 1: 
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Between pi and qi we have only a letters (or nothing). So we will commute 
the subword qi — p2 back towards pi as follows: 

• ji row 4, replace pi by Z2 

• 12 row 5, replace qi by ji 
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• p2 row 4, replace is by ii 

• i\ row 5, replace pi by 

• j2 row 4, replace qi by 

• is row 5, replace p^ by j2 

• delete columns pi , gi 

• delete pi and from List B columns fci and k2 respectively. 

This has the effect of moving the subword back through the word, but without 
changing more that a constant number of entries in the lists (with random 
access). 

The next case applies to our running example shown above. 
Case 2: 
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This time we will commute the subword q2 — P3 back past the subword of 
t-exponent zero and next to pi as follows: 

• ji row 4, replace pi by Z2 

• 12 row 5, replace pi by ji 

• P3 row 4, replace is by ii 

• ii row 5, replace pi by ps 

• 72 row 4, replace q2 by 

• is row 5, replace ps by j2 

• delete columns qi,P2 

• delete pi and q2 from List B columns ki and k2 respectively. 

Below wo show the two lists after commuting and deleting tt~^ for our 
running example after this step. We read the new word off List A following 
the pointers as a at tatat~^at~^a at~^ 
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The reniaining cases are similar and we leave it to the reader to imagine 
the instructions for each one. Corollary [2] guarantees that some commutation 
will reduce length in each case. 

With the cursor at position or if blank, the non-blank letter to its right, 
perform Subroutine 1 until unsuccessful, then Subroutine 2 until unsuccessful, 
and alternately until both are unsuccessful. Since each successful application 
of a subroutine reduces word length, the total number of successes of each is n 
throughout the whole algorithm. Once both are unsuccessful the entire word 
is again freely reduced and avoids a^^P, and the cursor is at the next non-blank 
position to the right of pa. We then resume Step 3 from this position. 

So after performing this procedure, List A contains a possibly shorter word 
in row 2, which is read starting at column and following pointers, and List 
B contains the correct data of ^-exponents and positions (although positions 
don't stay in order). Since we removed one of the 3 strikes, we start a.t p^ + l 
and continue filling out row 3 of List A, adding appropriate entries to List 
B, until we again get 3 strikes. Note that we do not backtrack, so the total 
number of right steps taken in Step 3 (assuming the random access model of 
computation allows us to read and write at any specified position in the table) 
is 0{n). The number of times we need to apply the subroutines (successfully 
and unsuccessfully) is also 0{n) regardless of how many times Step 3 is called, 
so so all together this step takes 0{n) time. 

At the end of this step, since the word has nonnegative i-exponent sum, 
and all "3 strikes" have been eliminated, the word must be of the form E, P, 
PN, NP, NPN, or PNP. 

Step 4: If the word at this stage is of the form PNP, its t-exponent sum 
must be zero, so it has a prefix of the form PN and suffix NP, both of zero 
i-exponent sum. Say pi,p2 are the two positions that the i-exponent is zero. 
We commute the prefix and suffix by rewriting pointers. If the word on List 
A is not written in order from left to right, we can create a new List A in 
which the word is in correct order, by reading the current list following the 
pointers. 
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So after this we can assume the configuration of List A is as follows: 
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Then do the following: 

• row 4 replace ii hy P2 

• P2 row 5 replace by 

• row 4 replace n + 1 by pi 

• pi row 5 replace by js 

• ji row 4 replace p2 by n + 1 

• n + 1 row 5 replace js by j\ . 

The word is now the form NPN . 

Step 5: At this point the word in List A row 2 is of the form E, P, PN, NP 
or NPN. We can ascertain which of these it is in constant time simply by 
checking the first and last t^^ letter in the word, which lie at most 3p steps 
from the ends of the tape (positions S and F), following pointers. 

• No t^^ letters: E 

• First t last t: P 

• First t last i"^: PN 

• First t-'^ last t: NP 

• First t-^ last f-^: NPN 

In the case E, the word is a* with \i\ < 3p, so by checking a finite list we 
can find a geodesic for it and be done. So for the rest of the algorithm assume 
u is of the form P, PN, NP, NPN. 

In the last two cases NP and NPN it is possible the word contains a 
subword of the form t^^a^Pt for an integer x, which will be in {±1,±2}. If 
so we want to replace it by , which will always reduce length. This is 
easily done in constant time, assuming the cursor is pointing to the column 
containing the first t letter in the word. From this column scan back at most 
3p letters to a t^^ letter and count the number of a*^ letters in between. 
Then if you find t^^a:''Pt rewrite with and then move forward to the next t 
letter. Repeat this at most 0{n) times (the maximum number of t letters in 
the word) until no such subword appears. Note that to locate the first t letter 
in the word to start this takes 0(n) time to scan the word, but you only need 
to do this once. 

We now have a word in one of the forms P, PN, NP, NPN which contains 
no t^^a^^t subword. 
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Step 6: In this step we apply Corollary [3] to push all of the a^^ letters in 
the word occur in a single sheet of the Cayley graph. The output of this step 
(the word in row 2 of List A) will be a word of the form t~^upt~^ where up 
is a word of type P with t-exponent > fc + Z, and fc, Z > and the word still 
does not contain a subword of the form t^^a^^t. 
Case P: The word is of the form up so done. 

Case PN: Say texp is the final i-exponent of the word, which occurs at 
positions pi and qi. If qi is not the end of the word (that is, there are a^^ 
letters at the end of the word) , then we want to push the a letters there back 
through the word to t-exponent 0, which starts after pi. The configuration of 
the tape is as follows (where we assume js ^ qi since there are a^^ letters at 
the end of the word): 
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Then do the following: 

• qi row 4, replace 12 with n + 1 

• n + 1 row 5, replace ja with qi 

• pi row 4, replace ii with 12 

• 12 row 5, replace qi with pi 

• row 4, replace n + 1 with ii 

• ii row 5, replace pi with 

So we have commuted the word a^™ at the end, through the subword of 
t-exponent 0. Check for cancellation of a^^a^^, if this occurs then cancel. 
Repeat up to 3p — 1 (constant) times. 

Next, let p2, q2 be the positions that (texp + 1) occurs. Repeat the proce- 
dure at this level. Again freely cancel. 

Iterate this until all a^^ letters are pushed into the middle of the word, so 
the resulting word is of the form upt~^ where up is a word of type P with 
t-exponent at least k. Note that at the top level there is only one a' subword, 
which will not be canceled, so there is no free cancellation of t letters in this 
step. At every other level there can be at most &p — 2 consecutive a^^ letters. 
Case NP: Same as previous case, this time pushing as to the right. 
Case NPN: Break the word into NP and PN subwords, with the NP 
subword ending with t, and each of zero t-exponent sum, and perform the 
above steps to push a letters to the right and left respectively, then ensure 
there is no t~^a'^Pt subword. In the NP prefix the maximum number of 
consecutive a^^ letters is 6p — 2 at every level except the top level, since we 
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cut the word immediately after at, so at this level we have at most 3p — 1 
consecutive o^^s, and in the PN suffix we can have at most 6p — 2 at every 
level, so all together there could be at most 9p — 3 consecutive a^^s. 

So after this step, the positive part of the word, up, stays within a single 
sheet of the Cayley graph. We write 

Up = a^Ha"' . . . a'^'^-Ha"'" 

where each |ei| < 9p and with cq not a multiple of p when the word is of the 
form NP or NPN. 

Step 7: In this step we remove all occurrences of a^^t in the word. Scan to 
the first t in row 2. If the preceding p letters are a^^ then replace a^^t by 
ta^^. Stay at this t letter and repeat until there is no a^^t, then move to the 
next t letter. Since each replacement reduces length the time for this step is 
linear. At the end you have eliminated all a^^t subwords so the word is of 
the form u = t~''upt~^ = t~''a'^°t . . . ta'^'"t~^ with |ei| <p for i <m. 

Step 8: Set = Mq. If |Mo| < 3p then stop, part one of the algorithm is 
done. If |Mo| > 'ip then we will replace the last term a*^™ = , by a word 
of the form a^^t . . . ta^'t^^ with \rji\ < p for i < s and |?7s| < 3p, as follows. 

• Go to the first after a^'° on the tape, then scan back 3p steps. If 
you read a^^^" in these steps, then replace the subword by ta^^t~^, 
which strictly reduces length. Then scan back another p steps from 
the t letter you have inserted, and if you read a^^t then replace it by 
ta^^. Repeat until you don't read p consecutive as or a~^s. 

If you did any replacing, you now have a word of the form u = 
t-^a'^^t . . . a^^^-^taPHa'^^H-H-^ with |ei| < p for i < m and |?7o| < p. 
The number of steps to do this is 0(|Mo|). Note that \Mi\ < |Mo|/p. 

• Repeat the previous step, by scanning back 3p from the last fr^ in- 
serted. If you read a^^^ replace and repeat the procedure as be- 
fore, and if |Mj| < 3p stop. Note that each \Mi\ < |Mi_i|/p, so 
|M,| =< |Mo|/(p*). 

Each iteration of this takes 0(|Mj|) = 0{\Mq\/{p^)) steps, so in total the time 
for this procedure is 

0(|Mo| + |Mo|/p+ |Mo|/p2 + ...)= 0(|Mo|) = 0{n) 

by the geometric series formula. 

So the word on the tape is now of the form u = t~^a^°t . . .a^'t"™ where 
< |eo| < p if fc > 0, |ej| < p for < i < g and |eq| < 3p. 

We summarise part one of the algorithm in the following lemma. 

Lemma 4. Any word w G Gp can be converted to a word u =g w of the form 

u = t-''a'H...ta'H-"' 
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with £{u) < i{w), k,m,,q > 0, q > k + m,0 < |eo| < p if k > 0, \ei\ < p for 
i = 0, . . .q— 1 and | Cq | < 3p, and moreover this can be achieved in linear time 
and space in the length of w. 

5. Part two of the algorithm 

The key to the final part of the algorithm are the following two lemmas. 
Roughly, they say that now that we have put the input word into a nice 
enough form, a geodesic for it can be found which (almost) fellow travels it 
in the same sheets of the Cayley graph, and so a geodesic can be obtained by 
tracking our current word in linear time and space. 

Lemma 5. Let 

u = t-''a'H...ta'H-"' 

with q > k + m, k,m > 0, eo ^ if k > 0, jcj | < p for i = 0, . . .q — 1 and 
I eg I < 3p. Then there is a geodesic v for u of the form 

v = t-''a'^H...ta'^n-' 

with k,r,s > 0, r]o ^ if k > 0, q — m = r — s, \r]i\ < p for i = 0, . . .r — 1 and 

\er\ < 3p. 

Proof. Let v be some geodesic for u, then applying part one of the algorithm 
we can put v into the form t~^ a'^°t . . .ta^^t~^ for some integers j,r,s, and 
ryo 7^ if J > 0, \r]i\ < p for i ~ 0, . . .r ~ 1 and \€r\ < 3p, otherwise v is not 
geodesic, and q — k — m = r — j — s>0 since u =c v. 

Apply the substitutions ta'f~^ = o'^ to u m times to get u' = t~'^u\a'^ 
where ui is a prefix of a'^°t . . . ta'^i ending in i, of t-exponcnt sum q — m, and 
c some integer. Do the same to v to get v' = t~^v\a'^ where v\ is a prefix of 
a'^^t . . .ta^'n. 

Since u =c v 1 =g v~^u =g a~'^v^'^P~^uia'^ which is of the form NP. By 
Britton's Lemma this word contains either ta^t~^ or t~^a^Pt, but since it is of 
type NP the subword must be t~^a^^t. U j > k this means jjo is a multiple 
of p, and if k > j then eo is a multiple of p, either way a contradiction. So 
k = j. 

This means that u and v both have the form t~''PN. □ 

We now complete the algorithm by proving that we can find such a u in 
the same slicx^t as u in linear time and space in the length of u. For case of 
exposition we divide this into two cases. We first consider the case that u is 
of the form P or NP. 

Lemma 6. Let u he as in the previous lemma with m = 0, that is, u = 

t~^a'^°t . . .ta'^''. Then v can be found, as in the previous lemm,a, so that u,v 
asynchronously 3p-fellow travel up to level q — m, in linear time and space in 
the length of u. 
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Proof. We draw a neighborhood of u in the Cayley graph as follows. Start 
at the endpoint of t~'', and label this point S. Draw a horizontal line of |eo| 
a edges, left if < and right if positive. Then draw a vertical t edge up 
from this line, and complete the picture by drawing in the brick containing 
a'^°t on its boundary. Label the corner corresponding to the endpoint of a'^°t 
by U, and on each corner compute the distance di,d2 back to S (which will 
be |eo| + 1 and p — |eo| + 1). If |rfi — c?2| < 1 then keep both labels, and 
store two words 51,52 which are geodesies to these points. If their difference 
is greater than 1 then discard the larger label and only keep the short one, 
plus a geodesic word gi to it. See Figure [2l 



o o 

o o o o o 

s 

Figure 2. First level. In these figures we are in the group G4. 

Now assume you have drawn this picture up to level j < g, so you have 
i bricks stacked on top of each other vertically, and the top brick has its 
top corner(s) labeled U corresponding to the endpoint of a'^°t. . .a'^'-^t, and 
di, ((^2) the shortest distance(s) back to S. Also you have stored geodesic(s) 
gi, (92) to the points labeled di,d2. 

From the point U, draw a horizontal line for a*^' to the left or right de- 
pending on the sign. Then draw a vertical t edge up. Now since je^l < p, the 
brick with boundary a'^H also contains the point (s) di, (^2)1 and so to compute 
the distance to the corners of the new brick, one simply computes from these 
points, since they are the closest points on the level i in this sheet. So label 
the corners of the new brick in level i + 1 by U,di, (^2) as before. Update 
gi, (52) by appending su0ix(es) aH. 

In this way one can draw the path u in its sheet up to level q, and keep 
track of the distances from S to each level of the sheet, using constant time 
and space for each level. Figure [5] shows the next two iterations of this. 

At level q, draw a*^' from U to the endpoint of u, which we mark with E. 
Now, a geodesic v has the form t~^viV2t~'' by the previous lemma, where 
Vi, V2 are both type P, and Vi has i-exponent q, and V2 exponent s > 0. So Vi 
is a geodesic from the point S to some point on level q of type P, so without 
loss of generality it is one of (?i, (?2 followed by some word a''. So a geodesic 
for u is of the form 

v = r''g^aPHaPH...taPT' 
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Figure 3. The next two levels 



for a; = 1,2 and s > 0, and again \pi\ < p for i < s and < < 3p. 
Now a''°ta''^t . . . ta'''t~'^ has length > 2s + 1 is a geodesic from dx to E and 
this distance is at most 3p since d{E,U) < 3p and d{U,dx) < 1. Therefore 
s < (3p — l)/2 is bounded, and so finding f is a matter of checking a finite 
number of possible suffixes, which can be done in constant time, so we are 
done. (Note that a much sharper bound could be obtained, but s < {3p— 1)/2 



1? 



di d2 
U 



Figure 4. The top level 

suffices to complete the proof). The final step then also takes constant time 

and space so all together we have used 0{q) time and space with q < n. 

It follows that this v 3p-fellow travels u asynchronously, since at each level 
U is within 1 of di, ^2, and so any point on u is contained in the same brick as 
some part of v up to level q, and at level q the suffix of v of length < (3p — 1) /2 
and the suffix of u of length < 3p, both paths from U to E, mean that any 
point on one is within 3p of the other. □ 

Finally we must deal with the cases when when u is type PN and NPN. 
The difference here is that the word v may not go up as high as u. As 
an instructive example, suppose u = (a^~Pt)"at~" {p > 2), which is in the 
form out the output of part one of the algorithm. This word has geodesic 



A LINEAR-TIME ALGORITHM TO COMPUTE GEODESICS 



17 



representative a, and so the geodesic for it no longer fellow travels it. In spite 
of this we have the following. 

Lemma 7. Let u he as in the previous lemma with m > 0, so this time, 
u — t~*'a'^'-'t . . .ta'^''t~"^ Then v can he found, as in the previous lemma, so 
that u, V asynchronously 3p-fellow travel up to level q — m in the sheet, in 
linear time and space in the length of u. 

Proof. Compute q — m = c, and write 

u = t-''a'H . . . ta'-Ha'^H. . . ta^'-t-"" 

(this can be done on the tape from part one, just find the column where the 
t-exponent is the same as the final t-exponent). Repeat the procedure from 
the previous lemma, up to level c. So we have a line at level c with points 
marked U, di, (c?2). Now the geodesic v must come up to this level, and then 
end with a word of the form a* or V2t~^ with V2 of type P. Note that |z| < 3p 
or V is not geodesic. So extend the line at level c by 3p — 1 edges to the right 
and left of the point (s) di,d2. See Figure [5l Read a''^ along the line from 



di d2 
U 

Figure 5. Level c, with 3p — 1 — 11 edges on either side of 
di,d2 added. 

the point U, draw vertical t edge up, and cover the line with bricks as before. 
Store this data for the remainder of the procedure, since it may turn out that 
the shortest path runs along this line to the endpoint. 




di d2 
U 



Figure 6. Covering level c with bricks. Here rji = 0. 

Carry on as we did before, reading u and drawing bricks at each level, but 
this time extend out each line by 3p — 1 a edges in each direction from di, (^2). 
Continue until level q = m + c. To store this diagram of bricks takes 0{n) 
space, since for each level we are keeping a constant amount of data (at most 
7 bricks at each level). 

Read a'^i = a''™ and mark this point by Eq. As before, if v extends above 
this level, then it can go only a bounded number of levels more, so draw these 
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layers of bricks in, aligned with the point Eq. So a geodesic to Eq will be one 
of a finite number of paths, as before. Choose a shortest path to Eq, append 
t~™' to it, and store it with its length. 

Consider the path i"™ from Eq to E. Suppose each point is labeled Ei, 
with Era = E. If a geodesic v for u does not travel up to level q, then it must 
travel to some level between q and c, via a point Ei. So for each Ei we will 
find and record a shortest path which travels to di, (1^2) on that level, then 
across at most 3p ~ 1 a-edges to Ei then down to E by edges. We find 
these paths as follows. 

Remove the layer of bricks which contain the point from the stored 

diagram. If the point (s) di, (^2) on the level containing Ei is more than 3p 
edge(s) away then you have disconnected the diagram, and we know that a 
geodesic to E via Ei is not possible, since it would contain a subword of the 
form a^^P which is not geodesic. 

Otherwise record the path t~'^gx followed by a power of a^^ then edges, 
together with its length. 

Do this for each level until you disconnect the diagram or reach the point 
E. Note that each level takes a constant amount of time and space, and once 
the diagram becomes disconnected, a lower level would not be connected again 
by the geometry of the sheet. 

Therefore when you are done, search through the list of lengths recorded 
and choose a shortest, and output the geodesic path associated with it. □ 
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